Overview
Brought to you by YData
Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 11497554 |
| Missing cells | 862 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 789.5 MiB |
| Average record size in memory | 72.0 B |
Variable types
| Text | 7 |
|---|---|
| Categorical | 2 |
Reproduction
| Analysis started | 2025-03-06 14:07:41.634718 |
|---|---|
| Analysis finished | 2025-03-06 14:23:33.653208 |
| Duration | 15 minutes and 52.02 seconds |
| Software version | ydata-profiling vv4.13.0 |
| Download configuration | config.json |
Variables
tconst
Text
Unique 
| Distinct | 11497554 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.7 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.5100721 |
| Min length | 9 |
Unique
| Unique | 11497554 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | tt0000001 |
|---|---|
| 2nd row | tt0000002 |
| 3rd row | tt0000003 |
| 4th row | tt0000004 |
| 5th row | tt0000005 |
| Value | Count | Frequency (%) |
| tt0000003 | 1 | < 0.1% |
| tt9916880 | 1 | < 0.1% |
| tt9916824 | 1 | < 0.1% |
| tt9916826 | 1 | < 0.1% |
| tt9916830 | 1 | < 0.1% |
| tt9916832 | 1 | < 0.1% |
| tt9916834 | 1 | < 0.1% |
| tt9916836 | 1 | < 0.1% |
| tt9916838 | 1 | < 0.1% |
| tt9916840 | 1 | < 0.1% |
| Other values (11497544) | 11497544 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 22995108 | |
| 1 | 11178421 | |
| 2 | 10595346 | |
| 0 | 9382735 | |
| 4 | 8732037 | 8.0% |
| 3 | 8662308 | 7.9% |
| 8 | 8403278 | 7.7% |
| 6 | 8375926 | 7.7% |
| 5 | 7263202 | 6.6% |
| 7 | 6942274 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 109342567 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 22995108 | |
| 1 | 11178421 | |
| 2 | 10595346 | |
| 0 | 9382735 | |
| 4 | 8732037 | 8.0% |
| 3 | 8662308 | 7.9% |
| 8 | 8403278 | 7.7% |
| 6 | 8375926 | 7.7% |
| 5 | 7263202 | 6.6% |
| 7 | 6942274 | 6.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 109342567 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 22995108 | |
| 1 | 11178421 | |
| 2 | 10595346 | |
| 0 | 9382735 | |
| 4 | 8732037 | 8.0% |
| 3 | 8662308 | 7.9% |
| 8 | 8403278 | 7.7% |
| 6 | 8375926 | 7.7% |
| 5 | 7263202 | 6.6% |
| 7 | 6942274 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 109342567 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 22995108 | |
| 1 | 11178421 | |
| 2 | 10595346 | |
| 0 | 9382735 | |
| 4 | 8732037 | 8.0% |
| 3 | 8662308 | 7.9% |
| 8 | 8403278 | 7.7% |
| 6 | 8375926 | 7.7% |
| 5 | 7263202 | 6.6% |
| 7 | 6942274 | 6.3% |
titleType
Categorical
Imbalance 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.7 MiB |
| tvEpisode | |
|---|---|
| short | |
| movie | 708125 |
| video | 306986 |
| tvSeries | 277985 |
| Other values (6) | 314590 |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 8.2459255 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | short |
|---|---|
| 2nd row | short |
| 3rd row | short |
| 4th row | short |
| 5th row | short |
Common Values
| Value | Count | Frequency (%) |
| tvEpisode | 8842179 | |
| short | 1047689 | 9.1% |
| movie | 708125 | 6.2% |
| video | 306986 | 2.7% |
| tvSeries | 277985 | 2.4% |
| tvMovie | 150130 | 1.3% |
| tvMiniSeries | 60199 | 0.5% |
| tvSpecial | 51496 | 0.4% |
| videoGame | 42183 | 0.4% |
| tvShort | 10581 | 0.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| tvepisode | 8842179 | |
| short | 1047689 | 9.1% |
| movie | 708125 | 6.2% |
| video | 306986 | 2.7% |
| tvseries | 277985 | 2.4% |
| tvmovie | 150130 | 1.3% |
| tvminiseries | 60199 | 0.5% |
| tvspecial | 51496 | 0.4% |
| videogame | 42183 | 0.4% |
| tvshort | 10581 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 11107874 | |
| e | 10819650 | |
| v | 10599995 | |
| i | 10559682 | |
| t | 10450842 | |
| s | 10228052 | |
| d | 9191348 | |
| p | 8893675 | |
| E | 8842179 | |
| r | 1396454 | 1.5% |
| Other values (10) | 2718223 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 94807974 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 11107874 | |
| e | 10819650 | |
| v | 10599995 | |
| i | 10559682 | |
| t | 10450842 | |
| s | 10228052 | |
| d | 9191348 | |
| p | 8893675 | |
| E | 8842179 | |
| r | 1396454 | 1.5% |
| Other values (10) | 2718223 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 94807974 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 11107874 | |
| e | 10819650 | |
| v | 10599995 | |
| i | 10559682 | |
| t | 10450842 | |
| s | 10228052 | |
| d | 9191348 | |
| p | 8893675 | |
| E | 8842179 | |
| r | 1396454 | 1.5% |
| Other values (10) | 2718223 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 94807974 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 11107874 | |
| e | 10819650 | |
| v | 10599995 | |
| i | 10559682 | |
| t | 10450842 | |
| s | 10228052 | |
| d | 9191348 | |
| p | 8893675 | |
| E | 8842179 | |
| r | 1396454 | 1.5% |
| Other values (10) | 2718223 | 2.9% |
primaryTitle
Text
| Distinct | 5169977 |
|---|---|
| Distinct (%) | 45.0% |
| Missing | 19 |
| Missing (%) | < 0.1% |
| Memory size | 87.7 MiB |
Length
| Max length | 458 |
|---|---|
| Median length | 405 |
| Mean length | 19.86731 |
| Min length | 1 |
Unique
| Unique | 4713837 ? |
|---|---|
| Unique (%) | 41.0% |
Sample
| 1st row | Carmencita |
|---|---|
| 2nd row | Le clown et ses chiens |
| 3rd row | Poor Pierrot |
| 4th row | Un bon bock |
| 5th row | Blacksmith Scene |
| Value | Count | Frequency (%) |
| episode | 4829829 | 12.7% |
| the | 1176720 | 3.1% |
| dated | 940908 | 2.5% |
| 459714 | 1.2% | |
| of | 404569 | 1.1% |
| a | 321853 | 0.8% |
| and | 254404 | 0.7% |
| in | 232228 | 0.6% |
| to | 190171 | 0.5% |
| 2 | 150513 | 0.4% |
| Other values (1413980) | 29071058 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26533623 | 11.6% | |
| e | 20063100 | 8.8% |
| i | 13155102 | 5.8% |
| o | 12971854 | 5.7% |
| a | 11418055 | 5.0% |
| s | 11183018 | 4.9% |
| d | 10121912 | 4.4% |
| r | 8223486 | 3.6% |
| t | 8136502 | 3.6% |
| n | 8110022 | 3.6% |
| Other values (193) | 98508414 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 228425088 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 26533623 | 11.6% | |
| e | 20063100 | 8.8% |
| i | 13155102 | 5.8% |
| o | 12971854 | 5.7% |
| a | 11418055 | 5.0% |
| s | 11183018 | 4.9% |
| d | 10121912 | 4.4% |
| r | 8223486 | 3.6% |
| t | 8136502 | 3.6% |
| n | 8110022 | 3.6% |
| Other values (193) | 98508414 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 228425088 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 26533623 | 11.6% | |
| e | 20063100 | 8.8% |
| i | 13155102 | 5.8% |
| o | 12971854 | 5.7% |
| a | 11418055 | 5.0% |
| s | 11183018 | 4.9% |
| d | 10121912 | 4.4% |
| r | 8223486 | 3.6% |
| t | 8136502 | 3.6% |
| n | 8110022 | 3.6% |
| Other values (193) | 98508414 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 228425088 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 26533623 | 11.6% | |
| e | 20063100 | 8.8% |
| i | 13155102 | 5.8% |
| o | 12971854 | 5.7% |
| a | 11418055 | 5.0% |
| s | 11183018 | 4.9% |
| d | 10121912 | 4.4% |
| r | 8223486 | 3.6% |
| t | 8136502 | 3.6% |
| n | 8110022 | 3.6% |
| Other values (193) | 98508414 |
originalTitle
Text
| Distinct | 5194979 |
|---|---|
| Distinct (%) | 45.2% |
| Missing | 19 |
| Missing (%) | < 0.1% |
| Memory size | 87.7 MiB |
Length
| Max length | 458 |
|---|---|
| Median length | 405 |
| Mean length | 19.864735 |
| Min length | 1 |
Unique
| Unique | 4739045 ? |
|---|---|
| Unique (%) | 41.2% |
Sample
| 1st row | Carmencita |
|---|---|
| 2nd row | Le clown et ses chiens |
| 3rd row | Pauvre Pierrot |
| 4th row | Un bon bock |
| 5th row | Blacksmith Scene |
| Value | Count | Frequency (%) |
| episode | 4829765 | 12.7% |
| the | 1124106 | 3.0% |
| dated | 940907 | 2.5% |
| 460722 | 1.2% | |
| of | 383798 | 1.0% |
| a | 313525 | 0.8% |
| and | 247633 | 0.7% |
| in | 226191 | 0.6% |
| to | 187797 | 0.5% |
| de | 151446 | 0.4% |
| Other values (1450862) | 29142850 |
Most occurring characters
| Value | Count | Frequency (%) |
| 26511220 | 11.6% | |
| e | 20007205 | 8.8% |
| i | 13203158 | 5.8% |
| o | 12963151 | 5.7% |
| a | 11499813 | 5.0% |
| s | 11188948 | 4.9% |
| d | 10124757 | 4.4% |
| r | 8199996 | 3.6% |
| n | 8137510 | 3.6% |
| t | 8099495 | 3.5% |
| Other values (180) | 98460234 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 228395487 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 26511220 | 11.6% | |
| e | 20007205 | 8.8% |
| i | 13203158 | 5.8% |
| o | 12963151 | 5.7% |
| a | 11499813 | 5.0% |
| s | 11188948 | 4.9% |
| d | 10124757 | 4.4% |
| r | 8199996 | 3.6% |
| n | 8137510 | 3.6% |
| t | 8099495 | 3.5% |
| Other values (180) | 98460234 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 228395487 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 26511220 | 11.6% | |
| e | 20007205 | 8.8% |
| i | 13203158 | 5.8% |
| o | 12963151 | 5.7% |
| a | 11499813 | 5.0% |
| s | 11188948 | 4.9% |
| d | 10124757 | 4.4% |
| r | 8199996 | 3.6% |
| n | 8137510 | 3.6% |
| t | 8099495 | 3.5% |
| Other values (180) | 98460234 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 228395487 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 26511220 | 11.6% | |
| e | 20007205 | 8.8% |
| i | 13203158 | 5.8% |
| o | 12963151 | 5.7% |
| a | 11499813 | 5.0% |
| s | 11188948 | 4.9% |
| d | 10124757 | 4.4% |
| r | 8199996 | 3.6% |
| n | 8137510 | 3.6% |
| t | 8099495 | 3.5% |
| Other values (180) | 98460234 |
isAdult
Categorical
Imbalance 
| Distinct | 45 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.7 MiB |
| 0 | |
|---|---|
| 1 | 371479 |
| 1978 | 130 |
| 1985 | 83 |
| 1980 | 66 |
| Other values (40) | 545 |
Length
| Max length | 4 |
|---|---|
| Median length | 1 |
| Mean length | 1.0002148 |
| Min length | 1 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 11125251 | |
| 1 | 371479 | 3.2% |
| 1978 | 130 | < 0.1% |
| 1985 | 83 | < 0.1% |
| 1980 | 66 | < 0.1% |
| 1979 | 63 | < 0.1% |
| 1984 | 41 | < 0.1% |
| 1974 | 33 | < 0.1% |
| 1982 | 32 | < 0.1% |
| 1972 | 29 | < 0.1% |
| Other values (35) | 347 | < 0.1% |
Length
Histogram of lengths of the category
| Value | Count | Frequency (%) |
| 0 | 11125251 | |
| 1 | 371479 | 3.2% |
| 1978 | 130 | < 0.1% |
| 1985 | 83 | < 0.1% |
| 1980 | 66 | < 0.1% |
| 1979 | 63 | < 0.1% |
| 1984 | 41 | < 0.1% |
| 1974 | 33 | < 0.1% |
| 1982 | 32 | < 0.1% |
| 1972 | 29 | < 0.1% |
| Other values (35) | 347 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11125451 | |
| 1 | 372300 | 3.2% |
| 9 | 780 | < 0.1% |
| 8 | 469 | < 0.1% |
| 7 | 392 | < 0.1% |
| 2 | 208 | < 0.1% |
| 6 | 146 | < 0.1% |
| 5 | 133 | < 0.1% |
| 4 | 85 | < 0.1% |
| 3 | 58 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11500024 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 11125451 | |
| 1 | 372300 | 3.2% |
| 9 | 780 | < 0.1% |
| 8 | 469 | < 0.1% |
| 7 | 392 | < 0.1% |
| 2 | 208 | < 0.1% |
| 6 | 146 | < 0.1% |
| 5 | 133 | < 0.1% |
| 4 | 85 | < 0.1% |
| 3 | 58 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11500024 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 11125451 | |
| 1 | 372300 | 3.2% |
| 9 | 780 | < 0.1% |
| 8 | 469 | < 0.1% |
| 7 | 392 | < 0.1% |
| 2 | 208 | < 0.1% |
| 6 | 146 | < 0.1% |
| 5 | 133 | < 0.1% |
| 4 | 85 | < 0.1% |
| 3 | 58 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11500024 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 11125451 | |
| 1 | 372300 | 3.2% |
| 9 | 780 | < 0.1% |
| 8 | 469 | < 0.1% |
| 7 | 392 | < 0.1% |
| 2 | 208 | < 0.1% |
| 6 | 146 | < 0.1% |
| 5 | 133 | < 0.1% |
| 4 | 85 | < 0.1% |
| 3 | 58 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
startYear
Text
| Distinct | 152 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.7521087 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1894 |
|---|---|
| 2nd row | 1892 |
| 3rd row | 1892 |
| 4th row | 1892 |
| 5th row | 1893 |
| Value | Count | Frequency (%) |
| n | 1425072 | 12.4% |
| 2021 | 507756 | 4.4% |
| 2022 | 490371 | 4.3% |
| 2018 | 457557 | 4.0% |
| 2023 | 454467 | 4.0% |
| 2019 | 453971 | 3.9% |
| 2017 | 451182 | 3.9% |
| 2020 | 435636 | 3.8% |
| 2016 | 426996 | 3.7% |
| 2015 | 401774 | 3.5% |
| Other values (142) | 5992772 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 11306873 | |
| 0 | 10422710 | |
| 1 | 7298469 | |
| 9 | 3997798 | 9.3% |
| \ | 1425072 | 3.3% |
| N | 1425072 | 3.3% |
| 8 | 1406997 | 3.3% |
| 7 | 1300422 | 3.0% |
| 3 | 1189585 | 2.8% |
| 4 | 1173709 | 2.7% |
| Other values (2) | 2193365 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43140072 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 11306873 | |
| 0 | 10422710 | |
| 1 | 7298469 | |
| 9 | 3997798 | 9.3% |
| \ | 1425072 | 3.3% |
| N | 1425072 | 3.3% |
| 8 | 1406997 | 3.3% |
| 7 | 1300422 | 3.0% |
| 3 | 1189585 | 2.8% |
| 4 | 1173709 | 2.7% |
| Other values (2) | 2193365 | 5.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43140072 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 11306873 | |
| 0 | 10422710 | |
| 1 | 7298469 | |
| 9 | 3997798 | 9.3% |
| \ | 1425072 | 3.3% |
| N | 1425072 | 3.3% |
| 8 | 1406997 | 3.3% |
| 7 | 1300422 | 3.0% |
| 3 | 1189585 | 2.8% |
| 4 | 1173709 | 2.7% |
| Other values (2) | 2193365 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43140072 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 11306873 | |
| 0 | 10422710 | |
| 1 | 7298469 | |
| 9 | 3997798 | 9.3% |
| \ | 1425072 | 3.3% |
| N | 1425072 | 3.3% |
| 8 | 1406997 | 3.3% |
| 7 | 1300422 | 3.0% |
| 3 | 1189585 | 2.8% |
| 4 | 1173709 | 2.7% |
| Other values (2) | 2193365 | 5.1% |
endYear
Text
| Distinct | 97 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.7 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 2 |
| Mean length | 2.0238319 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | \N |
|---|---|
| 2nd row | \N |
| 3rd row | \N |
| 4th row | \N |
| 5th row | \N |
| Value | Count | Frequency (%) |
| n | 11360548 | |
| 2019 | 7352 | 0.1% |
| 2018 | 7252 | 0.1% |
| 2017 | 7182 | 0.1% |
| 2020 | 7157 | 0.1% |
| 2021 | 7019 | 0.1% |
| 2022 | 6577 | 0.1% |
| 2023 | 6000 | 0.1% |
| 2016 | 5741 | < 0.1% |
| 2024 | 4968 | < 0.1% |
| Other values (87) | 77758 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| \ | 11360548 | |
| N | 11360548 | |
| 2 | 151769 | 0.7% |
| 0 | 139739 | 0.6% |
| 1 | 97384 | 0.4% |
| 9 | 60023 | 0.3% |
| 8 | 22145 | 0.1% |
| 7 | 18925 | 0.1% |
| 6 | 15490 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27883 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 23269117 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| \ | 11360548 | |
| N | 11360548 | |
| 2 | 151769 | 0.7% |
| 0 | 139739 | 0.6% |
| 1 | 97384 | 0.4% |
| 9 | 60023 | 0.3% |
| 8 | 22145 | 0.1% |
| 7 | 18925 | 0.1% |
| 6 | 15490 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27883 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 23269117 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| \ | 11360548 | |
| N | 11360548 | |
| 2 | 151769 | 0.7% |
| 0 | 139739 | 0.6% |
| 1 | 97384 | 0.4% |
| 9 | 60023 | 0.3% |
| 8 | 22145 | 0.1% |
| 7 | 18925 | 0.1% |
| 6 | 15490 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27883 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 23269117 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| \ | 11360548 | |
| N | 11360548 | |
| 2 | 151769 | 0.7% |
| 0 | 139739 | 0.6% |
| 1 | 97384 | 0.4% |
| 9 | 60023 | 0.3% |
| 8 | 22145 | 0.1% |
| 7 | 18925 | 0.1% |
| 6 | 15490 | 0.1% |
| 3 | 14663 | 0.1% |
| Other values (2) | 27883 | 0.1% |
runtimeMinutes
Text
| Distinct | 958 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 87.7 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 2 |
| Mean length | 1.9859534 |
| Min length | 1 |
Unique
| Unique | 280 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 12 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| n | 7844505 | |
| 30 | 340204 | 3.0% |
| 60 | 255478 | 2.2% |
| 22 | 198493 | 1.7% |
| 45 | 105243 | 0.9% |
| 15 | 102327 | 0.9% |
| 25 | 82379 | 0.7% |
| 44 | 82274 | 0.7% |
| 23 | 76354 | 0.7% |
| 10 | 76343 | 0.7% |
| Other values (948) | 2333954 | 20.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 7844508 | |
| \ | 7844505 | |
| 2 | 1210172 | 5.3% |
| 0 | 1096535 | 4.8% |
| 1 | 1008872 | 4.4% |
| 3 | 777354 | 3.4% |
| 4 | 768323 | 3.4% |
| 5 | 725104 | 3.2% |
| 6 | 525085 | 2.3% |
| 8 | 371123 | 1.6% |
| Other values (33) | 662025 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22833606 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 7844508 | |
| \ | 7844505 | |
| 2 | 1210172 | 5.3% |
| 0 | 1096535 | 4.8% |
| 1 | 1008872 | 4.4% |
| 3 | 777354 | 3.4% |
| 4 | 768323 | 3.4% |
| 5 | 725104 | 3.2% |
| 6 | 525085 | 2.3% |
| 8 | 371123 | 1.6% |
| Other values (33) | 662025 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22833606 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 7844508 | |
| \ | 7844505 | |
| 2 | 1210172 | 5.3% |
| 0 | 1096535 | 4.8% |
| 1 | 1008872 | 4.4% |
| 3 | 777354 | 3.4% |
| 4 | 768323 | 3.4% |
| 5 | 725104 | 3.2% |
| 6 | 525085 | 2.3% |
| 8 | 371123 | 1.6% |
| Other values (33) | 662025 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22833606 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 7844508 | |
| \ | 7844505 | |
| 2 | 1210172 | 5.3% |
| 0 | 1096535 | 4.8% |
| 1 | 1008872 | 4.4% |
| 3 | 777354 | 3.4% |
| 4 | 768323 | 3.4% |
| 5 | 725104 | 3.2% |
| 6 | 525085 | 2.3% |
| 8 | 371123 | 1.6% |
| Other values (33) | 662025 | 2.9% |
genres
Text
| Distinct | 2385 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 824 |
| Missing (%) | < 0.1% |
| Memory size | 87.7 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 10.943875 |
| Min length | 2 |
Unique
| Unique | 213 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Documentary,Short |
|---|---|
| 2nd row | Animation,Short |
| 3rd row | Animation,Comedy,Romance |
| 4th row | Animation,Short |
| 5th row | Short |
| Value | Count | Frequency (%) |
| drama | 1298950 | 11.3% |
| comedy | 751748 | 6.5% |
| talk-show | 725249 | 6.3% |
| news | 598679 | 5.2% |
| documentary | 554505 | 4.8% |
| drama,romance | 524304 | 4.6% |
| n | 506043 | 4.4% |
| reality-tv | 365972 | 3.2% |
| adult | 314492 | 2.7% |
| news,talk-show | 260313 | 2.3% |
| Other values (2375) | 5596475 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 13326852 | 10.6% |
| m | 9982927 | 7.9% |
| o | 9576901 | 7.6% |
| r | 8418910 | 6.7% |
| e | 8413216 | 6.7% |
| , | 6840408 | 5.4% |
| y | 5834103 | 4.6% |
| t | 5794351 | 4.6% |
| i | 4832789 | 3.8% |
| n | 4507740 | 3.6% |
| Other values (27) | 48290583 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 125818780 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 13326852 | 10.6% |
| m | 9982927 | 7.9% |
| o | 9576901 | 7.6% |
| r | 8418910 | 6.7% |
| e | 8413216 | 6.7% |
| , | 6840408 | 5.4% |
| y | 5834103 | 4.6% |
| t | 5794351 | 4.6% |
| i | 4832789 | 3.8% |
| n | 4507740 | 3.6% |
| Other values (27) | 48290583 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 125818780 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 13326852 | 10.6% |
| m | 9982927 | 7.9% |
| o | 9576901 | 7.6% |
| r | 8418910 | 6.7% |
| e | 8413216 | 6.7% |
| , | 6840408 | 5.4% |
| y | 5834103 | 4.6% |
| t | 5794351 | 4.6% |
| i | 4832789 | 3.8% |
| n | 4507740 | 3.6% |
| Other values (27) | 48290583 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 125818780 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 13326852 | 10.6% |
| m | 9982927 | 7.9% |
| o | 9576901 | 7.6% |
| r | 8418910 | 6.7% |
| e | 8413216 | 6.7% |
| , | 6840408 | 5.4% |
| y | 5834103 | 4.6% |
| t | 5794351 | 4.6% |
| i | 4832789 | 3.8% |
| n | 4507740 | 3.6% |
| Other values (27) | 48290583 |
Correlations
| isAdult | titleType | |
|---|---|---|
| isAdult | 1.000 | 0.097 |
| titleType | 0.097 | 1.000 |
Missing values
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.
Sample
| tconst | titleType | primaryTitle | originalTitle | isAdult | startYear | endYear | runtimeMinutes | genres | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | tt0000001 | short | Carmencita | Carmencita | 0 | 1894 | \N | 1 | Documentary,Short |
| 1 | tt0000002 | short | Le clown et ses chiens | Le clown et ses chiens | 0 | 1892 | \N | 5 | Animation,Short |
| 2 | tt0000003 | short | Poor Pierrot | Pauvre Pierrot | 0 | 1892 | \N | 5 | Animation,Comedy,Romance |
| 3 | tt0000004 | short | Un bon bock | Un bon bock | 0 | 1892 | \N | 12 | Animation,Short |
| 4 | tt0000005 | short | Blacksmith Scene | Blacksmith Scene | 0 | 1893 | \N | 1 | Short |
| 5 | tt0000006 | short | Chinese Opium Den | Chinese Opium Den | 0 | 1894 | \N | 1 | Short |
| 6 | tt0000007 | short | Corbett and Courtney Before the Kinetograph | Corbett and Courtney Before the Kinetograph | 0 | 1894 | \N | 1 | Short,Sport |
| 7 | tt0000008 | short | Edison Kinetoscopic Record of a Sneeze | Edison Kinetoscopic Record of a Sneeze | 0 | 1894 | \N | 1 | Documentary,Short |
| 8 | tt0000009 | movie | Miss Jerry | Miss Jerry | 0 | 1894 | \N | 45 | Romance |
| 9 | tt0000010 | short | Leaving the Factory | La sortie de l'usine Lumière à Lyon | 0 | 1895 | \N | 1 | Documentary,Short |
| tconst | titleType | primaryTitle | originalTitle | isAdult | startYear | endYear | runtimeMinutes | genres | |
|---|---|---|---|---|---|---|---|---|---|
| 11497544 | tt9916838 | tvEpisode | Episode #3.13 | Episode #3.13 | 0 | 2009 | \N | \N | Drama |
| 11497545 | tt9916840 | tvEpisode | Horrid Henry's Comic Caper | Horrid Henry's Comic Caper | 0 | 2014 | \N | 11 | Adventure,Animation,Comedy |
| 11497546 | tt9916842 | tvEpisode | Episode #3.16 | Episode #3.16 | 0 | 2009 | \N | \N | Drama |
| 11497547 | tt9916844 | tvEpisode | Episode #3.15 | Episode #3.15 | 0 | 2009 | \N | \N | Drama |
| 11497548 | tt9916846 | tvEpisode | Episode #3.18 | Episode #3.18 | 0 | 2009 | \N | \N | Drama |
| 11497549 | tt9916848 | tvEpisode | Episode #3.17 | Episode #3.17 | 0 | 2009 | \N | \N | Drama |
| 11497550 | tt9916850 | tvEpisode | Episode #3.19 | Episode #3.19 | 0 | 2010 | \N | \N | Drama |
| 11497551 | tt9916852 | tvEpisode | Episode #3.20 | Episode #3.20 | 0 | 2010 | \N | \N | Drama |
| 11497552 | tt9916856 | short | The Wind | The Wind | 0 | 2015 | \N | 27 | Short |
| 11497553 | tt9916880 | tvEpisode | Horrid Henry Knows It All | Horrid Henry Knows It All | 0 | 2014 | \N | 10 | Adventure,Animation,Comedy |